Feature/52 vla fine tuning by shorbaji · Pull Request #53 · anyscale/examples

shorbaji · 2026-03-27T13:49:19Z

Add vla fine tuning template

Fine-tunes the PI0.5 Vision-Language-Action model on a LeRobot robotics dataset stored in S3, using Ray Data for CPU preprocessing and Ray Train for distributed GPU training. Key features: - Streams LeRobot v3 (parquet + mp4) data from S3 via a custom Ray Data datasource with anonymous S3 access - Preprocesses on CPU workers (rename cameras, HWC->CHW, /255 normalise) so GPU workers are never blocked on I/O or video decoding - Expert-only fine-tuning: only the 4 action/time projection heads are trained; the PaliGemma backbone stays frozen - BF16 mixed precision throughout (no GradScaler needed) - Linear-warmup + cosine-decay LR schedule - Fault-tolerant checkpointing via ray.train.report() - Declarative, cloud-agnostic compute config targeting 8x L40S GPUs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Signed-off-by: Robert Nishihara <rkn@anyscale.com>

shorbaji requested a review from robertnishihara March 27, 2026 13:49

shorbaji marked this pull request as ready for review March 27, 2026 13:49

robertnishihara force-pushed the feature/52-vla-fine-tuning branch 3 times, most recently from 7c55ba2 to 530a0e1 Compare April 7, 2026 02:12

robertnishihara merged commit dca0393 into main Apr 7, 2026

robertnishihara deleted the feature/52-vla-fine-tuning branch April 7, 2026 02:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/52 vla fine tuning#53

Feature/52 vla fine tuning#53
robertnishihara merged 1 commit intomainfrom
feature/52-vla-fine-tuning

shorbaji commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shorbaji commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants